The multinational Arabidopsis steering subcommittee for proteomics assembles the largest proteome database resource for plant systems biology.
نویسندگان
چکیده
In the past 10 years, we have witnessed remarkable advances in the field of plant molecular biology. The rapid development of proteomic technologies and the speed with which these techniques have been applied to the field have altered our perception of how we can analyze proteins in complex systems. At nearly the same time, the availability of the complete genome for the model plant Arabidopsis thaliana was released; this effort provides an unsurpassed resource for the identification of proteins when researchers use MS to analyze plant samples. Recognizing the growth in this area, the Multinational Arabidopsis Steering Committee (MASC) established a subcommittee for A. thaliana proteomics in 2006 with the objective of consolidating databases, technique standards, and experimentally validated candidate genes and functions. Since the establishment of the Multinational Arabidopsis Steering Subcommittee for Proteomics (MASCP), many new approaches and resources have become available. Recently, the subcommittee established a webpage to consolidate this information (www.masc-proteomics.org). It includes links to plant proteomic databases, general information about proteomic techniques, meeting information, a summary of proteomic standards, and other relevant resources. Altogether, this website provides a useful resource for the Arabidopsis proteomics community. In the future, the website will host discussions and investigate the cross-linking of databases. The subcommittee members have extensive experience in Arabidopsis proteomics and collectively have produced some of the most extensive proteomics data sets for this model plant (Table S1 in the Supporting Information has a list of resources). The largest collection of proteomics data from a single study in A. thaliana was assembled into an accessible database (AtProteome; http://fgcz-atproteome.unizh.ch/index.php) and was recently published by the Baginsky lab. The database provides links to major Arabidopsis online resources, and raw data have been deposited in PRIDE and PRIDE BioMart. Included in this database is an Arabidopsis proteome map that provides evidence for the expression of ∼50% of all predicted gene models, including several alternative gene models that are not represented in The Arabidopsis Information Resource (TAIR) protein database. A set of organ-specific biomarkers is provided, as well as organ-specific proteotypic peptides for 4105 proteins that can be used to facilitate targeted quantitative proteomic surveys. In the future, the AtProteome database will be linked to additional existing resources developed by MASCP members, such as PPDB, ProMEX, and SUBA. The most comprehensive study on the Arabidopsis chloroplast proteome, which includes information on chloroplast sorting signals, posttranslational modifications (PTMs), and protein abundances (analyzed by high-accuracy MS [Orbitrap]), was recently published by the van Wijk lab. These and previous data are available via the plant proteome database (PPDB; http://ppdb.tc.cornell.edu) for A. thaliana and maize. PPDB provides genomewide experimental and functional characterization of the A. thaliana and maize proteomes, including PTMs and subcellular localization information, with an emphasis on leaf and plastid proteins. Maize and Arabidopsis proteome entries are directly linked via internal BLAST alignments within PPDB. Direct links for each protein to TAIR, SUBA, ProMEX, and other resources are also provided. A comprehensive database on the subcellular localization of Arabidopsis proteins was extensively updated in 2007 (SUBA; www.suba.bcs.uwa.edu.au) by the Millar lab. These data are linked to several other databases, and selected data are provided as web services via the BioMoby Dashboard. The database houses relational data with localization information from subcellular proteome studies, fluorescent protein targeting studies, AmiGO and UniProt information, as well as several bioinformatics prediction programs. The recent use of high-throughput techniques for phosphoproteomic analyses led the Heazlewood, Weckwerth, and Schulze labs to establish a database of phosphorylation sites in A. thaliana (PhosPhAt; http://phosphat.mpimp-golm.mpg.de) earlier this year. The database also incorporates a phosphoserine prediction algorithm that has been used to precompute phosphoserine sites across all TAIR gene models. A central, searchable database of MS/MS reference spectra (mostly Orbitrap accurate precursor ion mass data) derived from A. thaliana, Chlamydomonas reinhardtii, Medicago truncatula, potato, tomato, and other plants (ProMEX; http:// promex.mpimp-golm.mpg.de/home.shtml) was established last year by the Weckwerth lab. The database provides an interface to match newly generated MS/MS spectra against previously derived experimental MS/MS spectra; this process results in high-fidelity matching between real spectra and the identification of poor and unmatched spectra. In addition, the spectral count for each identified protein is exported into the search result table for semiquantitative analysis. The ProMEX database serves as a design tool for proteotypic peptides that can be used for targeted, accurate protein quantification in complex samples. Unidentified spectra of good S/N quality can be included in the ProMEX database; such spectra can facilitate the assignment of unknown spectra. The database cross-references the UniProt plant genome annotation initiative and other resources. A genome browser (AnnoJ Web 2.0; http://neomorph.salk.edu/ epigenome.html) developed by the Millar lab will be a tool to watch for in the future. It is currently being used for displaying
منابع مشابه
Arabidopsis leaf plasma membrane proteome using a gel free method: Focus on receptor–like kinases
The hydrophobic proteins of plant plasma membrane still remain largely unknown. For example in the Arabidopsis genome, receptor-like kinases (RLKs) are plasma membrane proteins, functioning as the primary receptors in the signaling of stress conditions, hormones and the presence of pathogens form a diverse family of over 610 genes. A limited number of these proteins have appeard in pr...
متن کاملPPDB, the Plant Proteomics Database at Cornell
The Plant Proteomics Database (PPDB; http://ppdb.tc.cornell.edu), launched in 2004, provides an integrated resource for experimentally identified proteins in Arabidopsis and maize (Zea mays). Internal BLAST alignments link maize and Arabidopsis information. Experimental identification is based on in-house mass spectrometry (MS) of cell type-specific proteomes (maize), or specific subcellular pr...
متن کاملManaging the green proteomes for the next decade of plant research
For the past decade the field of proteomics has transitioned from a highly specialized research area into a conventional technique widely employed by plant biologists. This approach now encompasses basic protein identification to advanced comparative studies. The result has been an abundance of proteomics data, often not readily available to the research community (Heazlewood, 2011). This has r...
متن کاملArabidopsis nucleolar protein database (AtNoPDB)
The Arabidopsis Nucleolar Protein Database (http://bioinf.scri.sari.ac.uk/cgi-bin/atnopdb/home) provides information on 217 proteins identified in a proteomic analysis of nucleoli isolated from Arabidopsis cell culture. The database is organized on the basis of the Arabidopsis gene identifier number. The information provided includes protein description, protein class, whether or not the plant ...
متن کاملEvaluation of Leaf Proteome in Wheat Genotypes Under Drought Stress
Drought stress in plants, the change (increase or decrease) in the production of plant proteins. Proteomics in recent years one of the most powerful tools that help us to study the changes in protein In order to investigate the proteome of wheat leaves in response to terminal drought, two genotypes susceptible and resistant wheat genotypes were evaluated under irrigated (non-stress) and rain-fe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of proteome research
دوره 7 10 شماره
صفحات -
تاریخ انتشار 2008